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Amendments to the Claims : 

This listing of claims will replace all prior versions, and listings, of claims in the application: 

1 . (Currently Amended) A method for providing captioning in an AV signal, 
the method comprising: 

selecting a number of lines of caption data which can be displayed at 

one time; 

automatically identifying a voice and speech pattern in an audio signal 
from a plurality of voice and speech patterns with the speech-to-text processing system; 

training a speech-to-text processing system to learn one or more new 
words in the audio signal; 

converting an directly translating the audio signal in the AV signal to 
caption data automatically with the speech-to-text processing system, wherein the direct 
translation is adjusted by the speech-to-text processing system based on the training and the 
identification of the voice and speech pattern; and 

associating the caption data with the AV signal at a time substantially 
corresponding to a video signal associated with the converted audio signal in the AV signal 
from which the caption data was directly translated with the speech-to-text processing 
system, wherein the associating further comprises synchronizing the caption data with one or 
more cues in the AV signal ; and 

displaying the AV signal with the caption data at the time substantially 
corresponding with the converted audio signal in the AV signal, wherein the number of lines 
of caption data which is displayed is based on the selection. 

2. (Original) The method as set forth in claim 1 further comprising: 

capturing the AV signal, the AV signal comprising the audio signal 
and the video signal; and 

providing the audio signal in the AV signal for the converting. 

3. (Original) The method as set forth in claim 1 wherein the converting 
further comprises: 

determining a first amount of data in the caption data; and 
providing the caption data for the associating when the first amount is 
greater than a threshold amount or when a first period of time has expired. 
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4. (Cancelled) 

5. (Original) The method as set forth in claim 1 wherein the associating 
further comprises embedding the caption data within the AV signal. 

6. (Original) The method as set forth in claim 1 further comprising displaying 
at least the video signal and the associated caption data. 

7. (Original) The method as set forth in claim 1 further comprising storing on 
at least one recordable medium at least the video signal and the associated caption data. 

8. (Original) The method as set forth in claim 1 wherein the converting 
further comprises translating at least one word in a first language of the audio signal into a 
second language, the at least one translated word included in the caption data. 

9. (Currently Amended) A speech signal processing system, the system 

comprising: 

a speech-to-text processing system that selects a number of lines of 
caption data which can be displayed at one time, automatically identifies a voice and speech 
pattern in an audio signal from a plurality of voice and speech patterns, trains to learn one or 
more new words in the audio signal, and directly translates conv e rts an audio signal in an A V 
signal to caption data based on the training and the identification of the voice and speech 
pattern; and 

a signal combination processing system that associates the caption data 
with the AV signal at a time substantially corresponding to a vid e o signal associated with the 
converted audio signal in the AV signal from which the caption data was directly translated 
with the speech-to-text processing system, wherein the signal combination processing system 
synchronizes the caption data with one or more cues in the AV signal ; and 

a display system that displays the AV signal with the caption data at 
the time substantially corresponding with the converted audio signal in the AV signal, 
wherein the number of lines of caption data which is displayed is based on the selection. 
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10. (Original) The system as set forth in claim 9 further comprising a source 
for providing the audio signal in the AV signal to the speech-to-text processing system. 

1 1 . (Original) The system as set forth in claim 9 wherein the speech-to-text 
processing system further comprises a counter that determines a first amount of data in the 
caption data and a timer that determines when a first period of time has expired, wherein the 
speech-to-text processing system providing the caption data for the associating when the first 
amount is greater than a threshold amount or when timer indicates that the first period of time 
has expired. 

12. (Cancelled) 

13. (Original) The system as set forth in claim 9 wherein the signal 
combination processing system embeds the caption data within the AV signal. 

14. (Original) The system as set forth in claim 9 further comprising a video 
monitor for displaying at least the video signal and the associated caption data. 

15. (Original) The system as set forth in claim 9 further comprising a 
translator that translates at least one word in a first language of the audio signal into a second 
language, the at least one translated word included in the caption data, the translating device 
coupled to the speech recognition processor. 

16. (Previously Presented) The system as set forth in claim 9 wherein the 
signal separation processing system, the speech-to-text processing system and the signal 
combination processing system are integrated within a device, the device being portable and 
usable in a classroom environment. 

17. (Currently Amended) A computer readable medium having stored 
thereon instructions for providing captioning which when executed by at least one processor, 
causes the processor to perform steps comprising: 

selecting a number of lines of caption data which can be displayed at 

one time; 
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identifying a voice and speech pattern in an audio signal from a 
plurality of voice and speech patterns with the speech-to-text processing system; 

training a speech-to-text processing system to learn one or more new 
words in the audio signal; 

converting an directly translating the audio signal in the AV signal to 
caption data automatically with the speech-to-text processing system, wherein the direct 
translation is adjusted by the speech-to-text processing system based on the training and the 
identification of the voice and speech pattern; and 

associating the caption data with the AV signal at a time substantially 
corresponding to a vid e o signal associated with the converted audio signal in the AV signal 
from which the caption data was directly translated with the speech-to-text processing 
system, wherein the associating further comprises synchronizing the caption data with one or 
more cues in the AV signal ; and 

displaying the AV signal with the caption data at the time substantially 
corresponding with the converted audio signal in the AV signal, wherein the number of lines 
of caption data which is displayed is based on the selection. 

18. (Original) The medium as set forth in claim 17 further comprising: 

capturing the AV signal, the AV signal comprising the audio signal 
and the video signal; and 

providing the audio signal in the AV signal for the converting. 

19. (Original) The medium as set forth in claim 17 wherein the converting 
further comprises: 

determining a first amount of data in the caption data; and 
providing the caption data for the associating when the first amount is 
greater than a threshold amount or when a first period of time has expired. 



20. (Cancelled) 



21 . (Original) The medium as set forth in claim 1 7 wherein the associating 
further comprises embedding the caption data within the AV signal. 
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22. (Original) The medium as set forth in claim 17 further comprising 
displaying at least the video signal and the associated caption data. 

23. (Original) The medium as set forth in claim 17 further comprising storing 
on at least one recordable medium at least the video signal and the associated caption data. 

24. (Original) The medium as set forth in claim 17 wherein the converting 
further comprises translating at least one word in a first language of the audio signal into a 
second language, the at least one translated word included in the caption data. 

25. (New) The method as set forth in claim 1 further comprising: 

determining a type of the encoder being used with the speech-to-text 
processing system; and 

retrieving settings for the speech-to-text processing system to 
communicate with the encoder based on the identification of the encoder. 



26. (New) The system as set forth in claim 9 wherein the speech-to-text 
processing system determining a type of the signal combination processing system being used 
and retrieves settings for the speech-to-text processing system to communicate with the signal 
combination processing system based on the identification of the signal combination 
processing system. 

27. (New) The medium as set forth in claim 17 further comprising: 

determining a type of the encoder being used with the speech-to-text 
processing system; and 

retrieving settings for the speech-to-text processing system to 
communicate with the encoder based on the identification of the encoder. 



